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THE SIGMA-SEQUENCE AND COUNTING OCCURRENCES OF SOME 
PATTERNS, SUBSEQUENCES AND SUBWORDS 


SERGEY KITAEV 


Abstract. We consid er sigm a-words, which are words used by Evdokimov in the construction 
of the sigma-sequence (Evdok |. We then find the number of occurrences of certain patterns and 
subwords in these words. 


1. Introduction and Background 


We write permutations as words tt = a±a 2 ■ ■ ■ a n , whose letters are distinct and usually consist of 
the integers 1,2 ,,n. 

An occurrence of a pattern p in a permutation 7 r is “classically” defined as a subsequence in 7 r (of 
the same length as the length of p) whose letters are in the same relative order as those in p. Formally 
speaking, for r < n, we say that a permutation a in the symmetric group S n has an occurrence of the 
pattern p 6 S r if there exist 1 < i\ < 12 < ■ ■ ■ < V < n such that p = cr(*i)cr(* 2 ) ... cr(i r ) in reduced 
form. The reduced form of a permutation a on a set {ji,j2, ■ ■ ■ ,jr}, where ji < J 2 < ■ • ■ < j r , is 
a permutation ay obtained by renaming the letters of the permutation er so that jt is renamed i 
for all i £ {1,... ,r}. For example, the reduced form of the permutation 3651 is 2431. The first 
case of classical patterns studied was that of permutations avoiding a pattern of length 3 in S3. 
Knuth [Knuthj found that, for any r £ S3, the number |S n (r)| of n-permutations avoiding r is C n , 
the nth Catalan number. Later, Simion and Schmidt SimSch] determined the number |S n (P)| of 
perm utations in S n simultaneously avoiding any given set of patterns P C S3. 

In [BabSteinj Babson and Steingrimsson introduced generalised permutation patterns that allow 
the requirement that two adjacent letters in a pattern must be adjacent in the permutation. In 
order to avoid confusion we write a ’’classical” pattern, say 231, as 2-3-1, and if we write, say 2-31, 
then we mean that if this pattern occurs in the permutation, then the letters in the permutation 
that correspond to 3 and 1 are adjacent. For example, the permutation 7 r = 516423 has only 
one occurrence of the pattern 2-31, namely the subword 564, whereas the pattern 2-3-1 occurs, in 
addition, in the subwords 562 and 563. A motivation for introducing these patterns in |BabSteinj 
was the stu dy of M ahonian statistics. A number of interesting results on generalised patterns were 
obtained in [Claes]. Relations to several well studied combinatorial structures, such as set partitions, 
Dyck paths, Motzkin paths and involutions, were shown there. 

Burstein [Burstein] considered words instead of permutations. In particular, he found the number 
| [k] n (P) | of words of length n in a fc-letter alph abet th at avoid all patterns fro m a set P C £3 simul¬ 
taneously. Burstein and Mansour [|BurMansl (resp. BurMans2, BurMans3|) considered forbidden 
patterns (resp. generalized patterns) with repeated letters. 

The most attention, in the papers on classical or generalized patterns, is paid to finding exact 
formulas and/or generating functions for the number of words or permutations avoiding, or having 
k occurrences of, certain patterns. In KitMansl the authors suggested another problem, namely 
counting the number of occurrences of certain patterns in certain words. These words were chosen 
to be the set of all finite approximations of a sequence generated by a morphism with certain 
restrictions. A motivation for this choice was the interest in studying classes of sequences and words 
that are defined by iterative schemes [Lothaire, |Salomaa . In [KitMans2] the authors also studied 
the number of occurrences of certain patterns in certain words. But there they choose these words 
to be the subdivision stages from which the Peano curve is obtained. The authors called these words 
the Peano words. The Peano curve was studied by the Italian mathematician Giuseppe Peano in 
1890 as an example of a continuous space filling curve. 
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In the present paper we consider the sigma-words , which are words used by Evdokimov in con¬ 
struction of the sigma-sequence [Evdok|. Evdokimov used this sequence to construct chains of 
maximal length in the n-dimensional unit cube. Independent interest to the sigma-sequence ap¬ 
pears in connection with the well-known Dragon curve , discovered by physicist John E. Heighway 
and defined as follows: we fold a sheet of paper in half, then fold in half again, and again, etc. 
and then unfold in such way that each crease created by the folding process is opened out into a 
90-degree angle. The “curve” refers to the shape of the partially unfolded paper as seen edge on. If 
one travels along the curve, some of the creases will represent turns to the left and others turns to 
the right. Now if 1 indicates a turn to the right, and 2 to the left, and we start travelling along the 
curve indicating the turns, we get the sigma-sequence [ Evdokimov|. In [Kitaev the sigma-sequence 
was studied from another point of view. It was proved there that this sequence cannot be defined 
by iterated morphism. 

Since the sigma-sequence w a is a sequence in a 2-letter alphabet, we consider patterns in 2- 
letter alphabets. Moreover, the patterns in a 1-letter alphabet (for example 1-1-1) correspond to 
two subsequences (for this example, these subsequences are 1 - 1-1 and 2 - 2 - 2 ), whereas the patterns 
in a 2 -letter alphabet (with at least one letter 2 ) uniquely determine the subsequences in w„ that 
correspond to them, and conversely. For example, an occurrence of the pattern 1-2-1 is an occurrence 
of the subsequence 1 - 2 - 1 , whereas an occurrence of the subsequence (subword) 211 is an occurrence 
of the pattern 211. Thus, any our result for a pattern, can be interpreted in term of subsequences 
or subwords, depending on the context, and conversely. 

In our paper we give either an explicit formula or recurrence relation for the number of occur¬ 
rences for some classes of patterns, subwords and subsequences in the sigma-words. In particular, 
Theorem [|, allows to find the number of occurrences of an arbitrary generalized pattern without 
internal dashes of length £, provided we know four certain numbers that can be easily calculated 
for the sigma-words Ck , Dk : Ck+i and Dk+i (to be defined below), where k = [log 2 £]. Theorem |] 
gives a recurrence relation for counting occurrences of patterns of the form T 1 -T 2 . In Section we 
discuss occurrences of patterns of the form T 1 -T 2 - • • • -Tk , where the pattern Tj does not overlap with 
the patterns r,;„i and r^i for i = 1,2,..., k — 1. Finally, Section [?| deals with patterns of the form 
[ti-t 2 ~ • • - -Tk], [ti-T 2 - • • - -Tk) and (ti-T 2 - • • •-Tfc] in Babson and Steingrfmsson notation, which means 
that we use ” [x” in a pattern p to indicate that in an occurrence of p, the letter corresponding to 
the x must be the first letter of a word under consideration, whereas if we use “y]”, we mean that 
the letter corresponding to y must be the last (rightmost) letter in the word. 


2. Preliminaries 


In Evdok, Yab|, Evdokimov constructed chains of maximal length in the n-dimensional unit 
cube using the sigma-sequence. The sigma-sequence w a was defined there by the following inductive 
scheme: 

Ci = 1 , Di = 2 
Ck+i = CfclDfc, Dk +1 = Ck^Dk 

k = 1 , 2 ,... 

and w a = lim Ck ■ Thus, the initial letters of w a are 11211221112212.... We call the words Ck 

k —>-oo 

the sigma words. The first four values of the sequence {Ck}k >l are 1, 112, 1121122,112112211122122. 

In [ Kitaev | an equivalent definition of w a was given: any natural number n yf 0 can be presented 
unambiguously as n = 2*(4s + a), where cr < 4, and t is the greatest natural number such that 2* 
divides n. If n runs through the natural numbers then a runs through some sequence consisting of 
1 and 3. If we substitute 3 by 2 in this sequence, we get w a . 

In this paper we count occurrences of patterns in the sigma-words, which are particular initial 
subwords of w a . However, the challenging question is to find the number of occurrences of patterns 
or subwords in an arbitrary initial subword of w „, or more generally, in a subword of w„ starting in 
the position i and ending in the position j. 

It turns out that for counting occurrences of certain patterns or subwords in C n , one needs to 
know the number of occurrences of certain patterns in D n . So, in the most cases, we give results for 
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both C n and D n . However, our main purpose is the words C n for n > 1, and in some propositions 
and examples we do not consider D n . 

In what follows, we give initial values for the words Ci and Dp. 

Ci = 1 Di = 2 

C 2 = 112 D 2 = 122 


C 3 = 1121122 D 3 = 1122122 

C 4 = 112112211122122 D 4 = 112112221122122 

C 5 = 1121122111221221112112221122122 D 5 = 1121122111221222112112221122122 

We now give some other definitions. 

A descent (resp. rise) in a word w = a 3 a 2 ... a n is an i such that a* > cq+i (resp. a* < a,i+ 1 ). 
It follows from the definitions that an occurrence of a descent (resp. rise) is an occurrence of the 
pattern 21 (resp. 12 ). 

Let c T n (resp. df) denote the number of occurrences of the pattern r in C n (resp. D n ). 

Suppose a word W = AaB, where A and B are some words of the same length, and a is a letter. 
We define the kernel of order k for the word W to be the subword consisting of the k — 1 rightmost 
letters of A, the letter a, and the k — 1 leftmost letters of B. We denote it by K-kiW). For example, 
£ 3 ( 111211221 ) = 12112. If |A| < k — 1 then we assume ICk(W) = e, that is, the kernel in this case 
is the empty word. Also, mk(r, W) denotes the number of occurrences of the pattern (or the word, 
or the subsequence depending on the context) r in ICk(W). 

We denote x-x - x (£ times) by x £ . Also, [a] denotes the least natural number b such that 

a < b. 


3. Patterns 1-1-1, 1-2 and 2-1 

It is easy to see that \C n \ = \D n \ = 2" — 1. The following lemma gives the number of the letters 
1 and 2 in C n and D n . 

Lemma 1. The number of Is (resp. 2s) in C n is 2 n ~ 1 (resp. 2 n ~ 1 — 1). The number of Is (resp. 
2s) in D n is 2 n ~ 1 — 1 (resp. 2 n ~ 1 ). 

Proof. It is enough to find the number of Is c n and d n in C n and D n respectively, since the number 
of 2s in C n and D n are obviously equal to \C n \ — c n and \D n \ — d n respectively. 

It is easy to see from the structure of C n and D n that 

J” Cn Cn — 1 T d n — 1 d“ 1, 

( dn — C n — 1 T d n — 1 , 

together with ci = 1 and c?i = 0. The solution to this recurrence is c n = 2 n ~ 1 and d n = 2 n ~ 1 — 1. □ 

Proposition 2. The number occurrences of the subsequence l fe (resp. 2 k ) in C n is ( 2 k ) (resp. 
( 2 fc -1 ))- Thus, the number of occurrences of the pattern l k in C n is equal to 
i k _ / 2 n ~ 1 - 1\ _ 2 n -k / 2 ra_1 - 1\ 

C ” ~ l k ) + { k )~2k )■ 

Proof. From Lemma [I], there are 2 n ~ 1 (resp. 2 n ~ 1 — 1) occurrences of the letter 1 (resp. 2) in C n , 
and thus there are ( 2 k ) (resp. ( 2 fe -1 )) occurrences of the subsequence l k (resp. 2 k ) there. □ 

Proposition 3. We have that for all n > 2, c!f 2 = dff 2 = 2 • 4 n 2 + (n — 2) • 2 n 2 , and 1 = 
^n _1 = 2 ‘ 4 n—2 n • 2 n ~ 2 . 

Proof. Let us first consider the pattern 1-2. An occurrence of this pattern in C n = C n -\lD n -i is 
either inside C n - 1 , or inside D n _ 1 , or the letter 1 is from the word C n - 1 1, whereas the letter 2 is 
from the word D n -\. Thus 

elf 2 = clf_ ± + dlf 2 x + { (the number of Is in C n - 1 ) + 1} ■ { the number of 2s in U„_ 1 }. 
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Using the same considerations for D n = C n -\2D n _\, one can get 

dn~ 2 = c„"_i + d]f_ i + { the number of Is in C n _ 1 } • { (the number of 2s in D„_ 1 ) + 1}. 

The number of Is and 2s in C n _ 1 and D n _ 1 is given in Lemma [l]. So, 

r c^ 2 = + 4-- 2 ! + 2” -2 • (2 n_2 + 1) 

l 4' 2 = 4- 2 r + 4'- 2 ! + 2 n_2 • (2 n_2 + 1) 

C n 2 \ = ( 1 1 \( C n~l ) ( 2™ -2 • (2 n_2 + 1) \ 

4" 2 / \ 1 1 ) V 4-i /V 2 — 2 - ( 2 — 2 + 1 ) ) 

together with C 2” 2 = 2 and d 2 " 2 = 2. Here, and several times in what follows, we need to solve 
recurrence relations of the form 

Xn — Ax n — 1 T 6, 

where A is a matrix, and x n , x n -\ and b are some vectors, where b sometimes depends on n. We 
recall from linear algebra that such relations can be solved by diagonalization of the matrix H, that 
is, by writing A = VDV~ 1 1 where D is a diagonal matrix consisting of eigenvalues of A , and the 
columns of V are eigenvectors of A. For example, if A is a 2 x 2 matrix that consists of Is, then we 
use 

(l 1\ = ( 1 1 0 0 W 1/2 -1/2 \ 

{1 1 ) V -1 1 7 V 0 2 ) V 1/2 1/2 ) 

for computing powers of A, and thus for solving the recurrence relations. For the recurrence we 
get that for all n > 2, c„ 2 = d^ 2 = 2 • 4 ra-2 + (n — 2) • 2 n-2 . 

In the same manner, we can get that for the pattern 2-1, 

c n~ 1 = C n-1. + 4-i + 2 n_2 • (2 n_2 — 1), 

dn 1 = ell, + dl~\ + 2" -2 • ( 2" _2 - 1 ), 

together with C 3" 1 = 2 and d | _1 = 2. This gives, that for all n > 2, c 2 " 1 = d 2 " 1 = 2A n ~ 2 —n-2 n ~ 2 . □ 

Proposition ^ shows that asymptotically, the numbers of occurrences of the patterns, or the 
subsequences, 1-2 and 2-1 in C n or D n are equal. 

4. Patterns without internal dashes 
R ecall the definitions in Section ||. 

Theorem 4. Let r = T 1 T 2 ... ti be an arbitrary generalized pattern without internal dashes that 
consists of Is and 2s. Suppose k = [log 2 £], a = m^{r 1 D^lCfc), and b = me(T, Dk2Ck). Then for 
n > k + 1 , we have 

c n = {a + b + c T k+1 + d T k+1 ) -2” k 2 - b, 
dn = (a + b + cl +1 + d k+1 ) -2” k 2 — a. 

Proof. Suppose n > k + 1. In this case, C n = C ra _ill) n _i = WiJCe(D k lCk)W 2 , for some words W\ 
and W '2 such that \W\\ = W 2 \■ Because of the definition of the kernel /Ce(D k lCk), an occurrence 
of the pattern r in C n is in either C n _ 1 , or D„_i, or K,k(D k lCk) (from the definitions |C ra _i H 
ICk(DklCk)\ = \D n _i fl ICk{DklCk)\ = £ — 1 and thus these intersections cannot be an occurrence 
of t). So, 

c n = c n -1 + d T n _ 1 + a, 

whereas in the same way, we can obtain that 

dn = C n- 1 + d T n _ l + b. 

By solving these recurrence relations, we get the desirable. □ 
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In particular, Theorem [| is valid for £ = 1, in which case the number of occurrences of r in C n 
(or D n ) is the number of letters in C n (or D n ). Indeed, in this case, k = 0, a = b = c[ = d\ = 1, 
hence c\ = d\ = 2 n — 1 = \C n \ = \D n \. Also, as a corollary to Theorem [| we have, that if 
a = b = c T k+1 = d k+1 = 0 for some pattern r, then this pattern never appears in sigma-sequence. 

All of the following examples are corollaries to Theorem [|. 

Example 5. Suppose r = 12. We have that k = 1, a = TO2(12, DilCi) = 0 and b = 777 2 (12, Di2C\) = 
0. Besides, c^ 2 = 1 and d\ 2 = 1- Thus using Theorem [|, for all n > 2, c 42 = 2 ra_2 . So, the number 
of rises in C n is equal to 2" -2 , for n > 2. 

If r = 21, again k = 1, but now a = m2(21, Di\C\) = 1 and b = m2(21, Di2C\) = 1. Besides, 
c 21 = 1 and d§ 4 = 1. From Theorem [|, for all n > 3, c 21 = 2 ra-2 — 1, which shows that the number 
of descents in C n is one less than the number of rises. 

Since in both cases a = b, using the recurrences in Theorem we have that c 42 = did 2 = 2 n ~ 2 , 
whereas c 21 = d 21 = 2 n_2 — 1. 

Example 6. Suppose r = 112 . We have that k = 2 , a = 7713(112, D 2 1 C 2 ) = 0 , and b = 
7773(112, D22C2) = 0 . Besides, dj 12 = 2 and d ^ 12 = 1 . Now, from Theorem [|, we have that for 
all ?7 > 3 , c “ 2 = d “ 2 = 3 • 2 n ~ 4 . 

Example 7. Suppose r = 221. We have that k = 2, a = 777 . 3 ( 221 , H 2 IC 2 ) = 1, and b = 
7773 ( 221 , D 22 C 2 ) = 1. Besides, c 221 = 0 and d 221 = 1. Now, from Theorem 0, we have that for 
all 77 > 3, c 221 = d 221 = 3 ■ 2 71-4 — 1. 

Example 8 . If r = 2212221 then k = 3, a = m 7 (221, .D 3 IC 3 ) = 0, b = m 7 (221, D 3 2C 3 ) = 1, 
£.2212221 _ aiK ] ^2212221 _ Q Thus for 77 > 4, C 2212221 = 2 n_4 — 1. 

5. Patterns of the form ti-t 2 

Theorem 9. Let p = ti-t 2 be a generalized pattern such that |ti| = k\ and |t 2 | = Suppose 
k = [log 2 (fci + fc 2 — 1)]. The following denote the number of occurrences of the subwords t± and r 2 
in the certain kernels: a Tl = mk^Ti, DfclCfc), a T2 = mk 2 (t 2 , DklCk), b Tl = 777.fc 1 (ri, Dk2Ck), and 
b T2 = 777fe 2 (r 2 , Dfc2Cfc). Also, let r“ ( resp. r%, r\, r 2 ) denote the number of occurrences of overlapping 
subwords T\ and r 2 in the word DklCk (resp. DklCk, Dk2Ck, Dk2Ck), where t\ € ^^(DklCk) 
and r 2 g Ck (resp. n G Dk and r 2 G ICk 2 (D k lCk), n G ICk 1 (D k 2Ck) and r 2 G Ck, n G D k and 
t 2 G ICk 2 (Dk2Ck)). Besides, we assume that we know c% and dfi for n > n,, i = 1,2. Tde77 for 
n > max(/c + 1, ni + 1,77 2 + 1), and dif are given by the following recurrence: 

( c; W 11 V c; - i ')4 a "V 

T 1 V 1 1 ) V 'll-, ) \ fin ) 

where 

a n = (cff—i + a Tl - rf)<Q_ 1 + (a T2 - r^cff^ 

and 

Pn = + b Tl - rj)d^_! + (b T2 - l' 2 ) c n- 1 - 

Proof. Suppose n > ma x(k + 1,77 1 + 1,77 2 + 1). Let us find a recurrence for the number cf (one can 
use the same considerations for df). 

An occurrence of the pattern r in C n = C n -ilD n _i is either inside C n _i, or inside D„_i, or 
begins in C n -\ or the letter 1 between C n -\ and D n _i and ends in Z)„_i or the letter 1. The first 
two cases obviously give c n -\ and d n -i occurrences of r. To count the contribution of the last to 
cases, we work with words instead of patterns. We do it to take in account the situations when t\ 
or r 2 consists of copies of only one letter. In this case, we cannot count occurrence of these patterns 
separately, and then use this information, since, for instance, occurrences of the pattern t\ = 111 
are subwords 111 and 222 (the last one of these subwords we do not need), whereas occurrences of 
the pattern n = 222 are not defined at all (222 is not a pattern). 

If an occurrence of ti-t 2 does not entirely belong to C n _ i or D n _ i then we only have one of the 
following possibilities: 

(a) the subword n entirely belongs to C n -\ and the subword r 2 entirely belongs to D„_i; 
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(b) the subword t\ belongs entirely to C„_i and the subword r 2 belongs to the kernel Ad 2 (-Dfcl Ck), 
where k = |dog 2 (fci-|-fc 2 —1)] is the least number that allow to control, in C n ( n > k), overlap¬ 
ping occurrences of subwords T\ and r 2 where 7 i is entirely from C n -\ and r 2 £ ICk 2 (-DfclCfc); 

(c) the subword r 2 belongs entirely to D n _\ and the subword n belongs to the kernel ( DklCf )■ 

In (a) we obviously have c ^ 1 _ 1 ■ cQ_ x possibilities. 

In (b) we have : • a T2 — 1 ■ rf possibilities, since we need to subtract those occurrences of 

T\ and r 2 that overlap. 

Analogically to (b), in (c) we have dff_ 1 -a Tl — d T p_ 1 -r“ possibilities, which completes the proof. □ 

Remark 10. For using Theorem one needs to know and d T n for patterns r without internal 
dashes. These numbers could be obtained by using Theorem |[ 

The following corollary to Theorem is straitforward to prove, using the fact that for non¬ 
overlapping patterns t\ and r 2 , r“ = r 2 = rj = r 2 = 0 . 

Corollary 11. We make the same assumptions as those in Theorem [^. Suppose additionally that 
the words t± and r 2 are not overlapping in the following sense: no one suffix of t\ is a prefix of 
r 2 . Then for n > max(fc -f 1, n\ + 1, n 2 + 1), cf and df are given by the same recurrence as that in 
Theorem [ij with 

a n ~ (, c n -1 + a ri)d n _i + a T2 c n _ 1 

and 

Pn = ( c n-l "b b Tl )d r f _ 1 + b T 2 C r f_ l . 

Remark 12. Corollary |ll] is valid under more weak assumptions, namely we only need the property 
of non-overlapping of the patterns t± and t 2 when one of them is in its kernel and the other one is 
not in its kernel. Example [l5| deals with the pattern r that has overlapping blocks n and r 2 , but 
Corollary [Tl| can be applied. However, from practical point of view, checking the fact if two subwords 
are non-overlapping is more easy than considering the kernels and checking the non-overlapping of 
the subwords there. 

Example 13. Suppose r = 12-21. We have that |ri| = |r 2 | = 2. Now, in the statement of Theorem|] 
we have that k = 2 , a Tl = 0, a T2 = 1, b Tl =0 and b T2 = 1. Also, since there are no overlapping 
occurrences of the subwords 12 and 21 in ^(1221112) and ^ 3 ( 1222112 ), we have r“ = 0, r 2 = 0, 
r\ = 0 and r 2 = 0. Besides, from example |], cff = dff = 2 n ~ 2 and cff = dff = 2 n ~ 2 — 1. Thus, 
a n = ft n = 4" -3 . Using the fact that c| 2 " 21 = 0 and d| 2 " 21 = 1, this allows us to get an explicit 
formula for c^ 2 ' 21 and d^ 2 ' 21 for n > 3: 

cl 2 " 21 = dl 2 ' 21 = h n ~ 2 - 3 • 2 n ~ 4 . 

In particular C 4 2 " 21 = 5. 

Example 14. Suppose r = 1-221. We have that |ti| = 1 and |t 2 | = 3. Moreover, the words t\ 
and r 2 are not overlapping, hence we can use Corollary [TT|. We have that k = 2, a T1 = 1, a T2 = 1, 
b T1 = 0 and b T2 = 1. From example |^, d 221 = 3 • 2 71-4 — 1. Also, the number of occurrences of the 
letter 1 (the subword t\ = 1) is given by Lemma |]: c\ = 2 n ~ 1 . So, a„ = 6 • 4" -4 + 3 • 2 n ~ 5 — 1 and 
P n = 6 • 4” -4 . One can get now an explicit formula for c 4-221 and d 4-221 for n > 4: 

c 4 ' 224 = i4 n " 2 + 27 • 2 " -5 - n - 7, 

d 4 " 224 = i4 "“ 2 + 21 ■ 2 71-5 - 8 . 

In particular, df 221 = 47. 

Example 15. Suppose r = 112-21. We have that |ti| = ki =3 and |r 2 | = ^’2 = 2. The other- 
parameters in Theorem O are k = 3, a Tl = 0, a T2 = 1, b Tl = 0, b T2 = 1 , rf = r% = r\ = r\ = 0. From 
Example for n > 4, c„ 2 = 3 • 2 n-4 , and from Example [|, d 24 = 2 n_2 — 1. Thus, in Theorem ^j, 
a n = Pn = c 4 ^ 1 (d 21 _ 1 + 1) = 3 • 4" -4 . Now, we solve the recurrence relation from the theorem to 
get, that for n > 3 

112-21 _ J112-21 _ ^ /in— 3 _ nn -4 
c n ~ u n — o ' ^ z 
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6. Counting occurrences of t 1 -t 2 - r k 

In this section we study the number of occurrences of a pattern r = t\ -t 2 - ■ • • -T k . where Tj 
are patterns without internal dashes. We say that r consists of k blocks. We assume that for 
i = 1, 2,..., k — 1, the pattern does not overlap with the patterns and Tj+i. In this case 
we give a recurrence relation for the number of occurrences of r, provided we know the number 
of occurrences of certain patterns consisting of less than, or equal to, k — 1 blocks, as well as 2 k 
certain numbers which can be calculated by considering the words D^lCe and Cf2D p, where t is 
the maximum number such that t < maxi[log 2 |Tj|"|. The cases of k = 1 and k = 2 are studied 
in the previous sections; they give the bases for our calculations. However, the case of overlapping 
patterns is not solved, and it remains as a challenging problem, since an answer to this problem 
gives the way to count occurrences of an arbitrary generalized pattern, or an arbitrary subsequence, 
in cr-words. 


Theorem 16. Let r = t\-t 2 - ■ ■ ■ -r k be a generalized pattern such that |rj| = ki for i = 1,2,..., fe. 
We assume that for i = 1, 2,..., k — 1, the subword Ti does not overlap with the subwords Ti_i and 
Ti+i in the following sense: no one suffix of is a prefix of Ti and no one suffix of Ti is a prefix 
of Ti + i. Suppose ti = [log 2 kf\, l = max,; ti, and for the subwords Ti we have a, = m ki (rj, D^ i 1) 
and bi = m ki (t^, D^LlC/if), for i = 1,2 ,k. We assume that we know cffjf Ti and Tk for 

each 1 < i < k — 1 and for all n > n*. Then for all n > max(f + 1, n* + 1), c f n and dare given by 
the following recurrence: 



'* . d Ti+1 ~ 

a n -1 

'■ r k • 

)+e( 

ai 

1 

1 

1 

• d Ti+1 ~ 

a n -1 

•• T k 

+ 1 

■■-Tk 


v bi- 

n — -Ti-! 
c n- 1 

• d Ti+1 ~■■ 

a n- 1 

■~Tk 


Proof. We consider only c^, since the same arguments can be applied to df. We use the considera¬ 
tions similar to those in Theorem ||. 

An occurrence of the pattern r in C n = C n -\lD n -i can be entirely in C n or D n . The first term 
counts such occurrences. Otherwise, we have two possibilities: either the letter 1 between the words 
Cn-i and D n _i does not belong to an occurrence of r, or it does do it, in which case there exist 
i (exactly one) such that the subword r,; occurs in its kernel. The first sum in the statement is 
obviously responsible for the first of this cases, whereas the second sum is responsible for the second 
case (in the last case we use the fact that subwords Ti are not overlapping). □ 


As a corollary to Theorem HI w e have Corollary 

The following example is another corollary to Theorem [Ti~i[ 


Example 17. Suppose r = 2 — 1 — 221, that is, ti = 2, r 2 = 1 and T3 = 221. So, parameters in 
Theorem [l6| are the following: fci = fc 2 = 1, k% = 3, t\ = t 2 = 1, £3 = 2, t = 2. From D\lCi = 211 
we obtain ai = 0, a 2 = 1- From D 2 IC 2 = 1221112 we obtain 0,3 = 1. From Di2Ci = 221 we get 
61 = 1, b 2 = 0. From D 22 C 2 = 1222112 we get 63 = 1. Besides, from Proposition ||, Examples [?] 
and [b|, we have 

cff-^ = c 2 - 1 = 2 • 4 n ~ 2 - n ■ 2 n ~ 2 , for n > 1; 
d T n 3 = 4 21 = 3 • 2 n-4 - 1, for n > 3; 


dff- T3 = 4" 221 = | • 4 n-2 + 21 • 2 n ~ 5 - 8, for n > 4. 

Also, the number of occurrences of the subword ti = 2 in C n is given by Proposition ^]: off = c 2 = 
2 n_1 — 1. So, the number of occurrences of the pattern r in C n and D n , for n > 5, satisfies the 
following recurrence relation: 




+ 


5 on 25— 3n a n 171 on _i_ n 

1024 8 + 256 4 64 ^ 


x n—1 


1024 c 


+ 


256 

21 —3n /yn _ 

256 


with initial conditions C5 = 70 and = 74. 
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7. Patterns of the form [ti-t 2 -r fc ], [ti-t 2 - r k ) and (ti-t 2 - r k ] 


We recall that according to Babson and Steingrimsson notation for generalized patterns, if we 
use ”[” in a pattern, for example if we write p = [1-2), we indicate that in an occurrence of p, the 
letter corresponding to the 1 must be the first letter of a word under consideration, whereas if we 
write, say, p = (1-2], then the letter corresponding to 2 must be the last (rightmost) letter of the 
word. 

In the theorems of this section, we assume that we can find the numbers cff ~ T2 Tk and dff ~ T2 Tk 
for any patterns Ti, i = 1,2,..., fe, without internal dashes. For certain special cases, these numbers 
can be obtained using the theorems of Sections || and |h[ 


Theorem 18. Suppose t\ and t 2 are two patterns without internal dashes such that |ri| = k\ and 
\t 2 \ = k 2 . Also, suppose £\ = log 2 (fci + 1), £ 2 = log 2 (fc 2 + 1) and £ = log 2 (fci +k 2 + 1). Let a(ri, t 2 ) 
be the number of overlapping subwords t\ and t 2 in Ce such that t\ occurs as k\ leftmost letters 
of C e ; &(ti,t 2 ) is the number of overlapping subwords ri and t 2 in Ce such that t 2 occurs as k 2 
rightmost letters of Ce- We assume that we know cff and dff for i = 1,2 and for all n > n*. 


i. 

For 

n 

> 

max(!i, 

n* 

0, 









r hi-r 2 ) 

J 

f 


if Ce x does not begin 

with n, 





'll 


1 

l C 

- a(n,T 2 ), 

otherwise. 


ii. 

For 

n 

> 

max(f 2 , 

n * 

0, 









4 T1 

-1-2] 




if Ce 2 does not end with r 2 , 








l of; 

- b(n,T 2 ), 

otherwise. 


iii. 

For 

n 

> 

t. 













r 

0, 

if Ce does not begin with ti or end 

! with t 2 





r LTl-T 2 j _ 

= \ 











\ 

1, 

otherwise. 



iv. 

For 

n 

> 

max(fi, 

n * 

), 









i Ti - T2 > 

= | 

' 0, 


if De x does not begin 

with ri, 







1 

k dn 2 

- a(Ti,r 2 ), 

otherwise. 


V. 

For 

n 

> 

max(f 2 , 

n* 










4 ri ' 



f °> 


if De 2 does not end 

with t 2 , 





-r 2 ] 

= 










l « 

- b(n,T 2 ), 

otherwise. 



vi. For n > £, 


Jri ~t 2 ] 
u n 


0, if De does not begin with ti or end with r 2 , 
1, otherwise. 


Proof. We prove case i, all the other cases are then easy to see. 

Clearly, if Ce 1 does not begin with ti then C n does not begin with ti for all n> £ i, which means 
that elf 1 T2 ' > = 0 in this case. Otherwise, to count occurrences of the pattern [ti-t 2 ) is the same as 
to find the number of occurrences of the pattern r 2 in C n and then subtract the number of such 
occurrences of r 2 that begin from the i-th letter of C n , where 1 < i < k\. □ 

The following two examples are corollaries to Theorem El- 

Example 19. Suppose we have the patterns eri = [1122 — 21211) and a 2 = (21221 — 12]. From 
Theorem [l^, off = dff = 0, since C3 does not begin with 1122 (£1 = 3). Also, off = dff = 0, since 
C3 does not end with 12 (i 2 = 3). 


Example 20. Suppose r = [112-21). We have that k\ = 3, £\ = 2 and C 2 begins with the subword 
112. Besides, a(112, 21) = 1 and, from Example ||, c^ 1 = = 2 n ~ 2 — 1. Theorem [l^ now gives, 

that for n > 3, we have c^ 12 21 ^ = cff — a(ri, r 2 ) = 2 n_2 — 2. 
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The following theorem is straitforward to prove using the assumptions concerning non-overlapping 
of certain subwords. 

Theorem 21. Let {ti,t 2 , • ■ ■ ,T k } be a set of generalized patterns without internal dashes. Suppose 
In| = si, |r fc | = s k , 4 = log 2 (si + 1) and 4 = log 2 (s fc + 1). Also, i = max(4,4). 

i. With the assumption that the subword t\ does not overlap with the subword r 2 , that is, no 
one suffix of t± is a prefix of t- 2 , we have 
(a) 

0, if Ci ± does not begin with t\, 


„[ti - T 2-Tfc) 


r2 -r 3 -Tfc 


(b) 


0 , 


, otherwise. 


if Di x does not begin with n, 


^[n-r 2 -r fc ) _ 

[ dff~ T3 Tk , otherwise. 

ii. With assumption that the subword r k -i does not overlap with the subword r k , that is, no 
one suffix of r k -1 is a prefix of r k , we have 
(a) 

0, if Ce k does not end with r k , 


-Tfc] 


T1-T2- Tk -1 


(b) 


0, 


otherwise. 


if Df, k does not end with r k , 


^(Tl-T2-Tfc] _ 

[ dm T2 Tfc - 1 ; otherwise. 

iii. With the assumption that the subword ti does not overlap with the subword T 2 , and the 
subword r k -i does not overlap with the subword r k , we have 
(a) 

0, if Ci does not begin with t\ or does not end with r k , 


,[ti-T 2 -Tfc] 


(b) 

d [Tk-Tl-.-Tk] = 


c r 2 r 3 ••• T k otherwise. 


0, if De does not begin with t\ or does not end with r k , 

dl? T3 Tfc ~ 1 , otherwise. 


The following example is a corollary to Theorem |2l| . 

Example 22. Suppose r = [112-1-221-22]. The parameters of Theorem are ki = 3, = 2, 

^i = 2, ^2 = 2,£ = 2. Cs begins with the subword 112 and ends with the sub word 22. Thus by 
Theorem P and Example ITl 4 112 - 1 - 221 - 22 ! = c i- 22 i = 1^-2 + 27 . 2 n-5 _ n _ 7 . 
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